Evaluation Metrics Trade-off for Recommendation Systems
ثبت نشده
چکیده
Following the famous Netflix Prize competition, accuracy metrics (such as RMSE) became a de facto standard in the community to evaluate and compare performance of recommendation algorithms. However, it has been shown, that accuracy is not the best way to quantify the quality of recommendations, especially in the context of predicting online performance from offline evaluations [1]. Thus, it makes sense to look at other metrics, such as novelty, coverage, or serendipity, and in particular, trade-off between these metrics [2]. In this project, a student will work with large news recommendation datasets, collected online on swissinfo.ch and lepoint.fr. The goal would be to implement a number of recommender algorithms with tunable hyperparameters, and generate trade-offs between different evaluation metrics by varying these hyperparametrs. A student will then analyse these trade-offs by plotting trade-off curves, comparing them for various algorithms, examining the area under the curves, and possibly looking at their impact on CTR and online success rate.
منابع مشابه
A multi-objective resource-constrained optimization of time-cost trade-off problems in scheduling project
This paper presents a multi-objective resource-constrained project scheduling problem with positive and negative cash flows. The net present value (NPV) maximization and making span minimization are this study objectives. And since this problem is considered as complex optimization in NP-Hard context, we present a mathematical model for the given problem and solve three evolutionary algorithms;...
متن کاملOptimizing the Recency-Relevancy Trade-off in Online News Recommendations
Online news media sites are emerging as the primary source of news for a large number of users. The selection of ‘frontpage’ stories on these media sites usually takes into consideration several crowdsourced popularity metrics, such as number of views or shares by the readers. In this work, we focus on automatically recommending front-page stories in such media websites. When recommending news ...
متن کاملEvaluating the effectiveness of explanations for recommender systems Methodological issues and empirical studies on the impact of personalization
When recommender systems present items, these can be accompanied by explanatory information. Such explanations can serve seven aims: effectiveness, satisfaction, transparency, scrutability, trust, persuasiveness, and efficiency. These aims can be incompatible, so any evaluation needs to state which aim is being investigated and use appropriate metrics. This paper focuses particularly on effecti...
متن کاملAssessing the Trade-Off between System Building Cost and Output Quality in Data-to-Text Generation
Data-to-text generation systems tend to be knowledge-based and manually built, which limits their reusability and makes them time and cost-intensive to create and maintain. Methods for automating (part of) the system building process exist, but do such methods risk a loss in output quality? In this paper, we investigate the cost/quality trade-off in generation system building. We compare six da...
متن کاملDimensions and Metrics for Evaluating Recommendation Systems
recommendation systems support users and developers of various computer and software systems to overcome information overload, perform information discovery tasks and approximate computation, among others. They have recently become popular and have attracted a wide variety of application scenarios from business process modelling to source code manipulation. Due to this wide variety of applicati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015